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Abstract. The manuscript presents a theoretical proof in conglomeration with new definitions 
on Inaccessibility and Inside for a point S related to a simple or self intersecting polygon T. The 
proposed analytical solution depicts a novel way of solving the point in polygon problem by employing 
the properties of epigraphs and hypographs, explicitly. Contrary to the ambiguous solutions given 
by the cross over for the simple and self intersecting polygons and the solution of a point being 
multi-ply inside a self intersecting polygon given by the winding number rule, the current solution 
gives unambiguous and singular result for both kinds of polygons. Finally, the current theoretical 
solution proves to be mathematically correct for simple and self intersecting polygons. 
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1. Introduction. Given a polygon V or the vertices of the polygon, say (x±, yi), 
0^2) 2/2) ■■■ (x n , Vn)i it is desired to know whether a sample point S (xq, yo) lies within 
V . The status of a point S related to a polygon V being termed as inside needs to be 
defined correctly. The definition is very important in order to retreive unambiguous 
results for not only simple but intersecting polygons also in a 2D Cartesian plane. 
In this manuscript, new definition of inaccessibility and inside has been proposed in 
order to accurately specify the meaning behind the inclusion of a point within or 
without a polygon. 

Cross Over ([3], [5], [5J, [7]) states that if a semi infinite line drawn from S cuts 
the V odd number of times, then the point is inside the polygon. Three issues arise 
in this case, i.e. • depending on the orientation of the line from the query point, 
odd or even values can be obtained, if the line passes through vertices. This gives rise 
to ambiguous results for the same point with different rays at different orientation. 
A prevalent solution is the shifiting of the ray infinitesimally, but then again solution 
may change drastically depending on the direction of the shift. Even though this 
may be a rare case with negligible chance of occurance, the issue persists, leading to 
ambiguous results. • A second issue is that of repeating the cross over multiple times 
until the point lies inside the polygon. This leads to non determinism as it is not 
known how many times the rays need to be shot to get an affirmative answer, if ever 
it is conducted. • In case of intersecting polygons, areas exist which give a different 
solution than the winding number rule concept. 

Winding Number Rule ([7], [3], PQ) states that the number of times one loops 
around S while traversing V before reaching the starting point on the polygon shows 
whether the point is inside the polygon or not. So a number £ greater than one can 
mean that the point is £ times inside the polygon. This is an issue because if a point 
lies inside a polygon once, it lies forever. Thus £ > 1 depicts the idea of redundancy. 

As will be explained later in detail, the current solution looks at these problems 
afflicting the status of point related to polygon from a different perspective. The 
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manuscript defines the concepts of inaccessibility and inside of a polygon V while 
relating to the query point S. The proposed solution is motivated from [5] but has 
a major difference in using S as a reference point to draw a line chain through it, 
that cuts the polygon at different intersection points. The following section ^ shed 
light on novel algorithm explained with the assumptions involved. Next a theoretical 
analysis of the solution is given in [3] A detailed comparison with the crossover and the 
winding number with the proposed algorithm is made in sections [4] and [5j Finally, 
the conclusion is reached in section [6] For detailed analysis of the experimental 
results and time complexity of the algorithm please see the appendix below or visit 
|http://arxiv.org/abs/1010.0552| 

2. A Novel Algorithm. The novel algorithm in simple terms can be described 
as follows. Given the sample point (xq, yo), a horizontal line y — yo is drawn through 
S to cut the V at q locations {(x™ 1 , yo) , ■■■ , { x q lt ,Uo) }, thus breaking the polygon 
into q chains. 

Definition 2.1. A chain is a series of connected edges of the polygon whose 
starting and ending points lie on the horizontal straight line that passes through S. 
Mathematically, a chain is a function f c , with a closed domain defined by the starting 
and ending points on the horizontal line passing through the point of test S, and a 
range that is the graph of the currently under investigation connected edges of the 
polygon . 

Each chain is then checked for whether its two endpoints contain the test point 
between them; if not, the chain is discarded. Discarded chains are termed as invalid 
chains and those kept for further consideration are referred to as valid chains. The 
remaining chains are then tested for intersection with a vertical line x = xq through 
S. The intersections found are sorted by height, and paired up. If the test point is 
not between a pair, it is outside. This criterion of containment is checked via the 
definitions of affine sets and afhnc combination as follows: 

Definition 2.2. A set T C 7Z n is an affine set, if for any two points x i: Xj £ 
T(j > i) and 9 G [0, 1], Ox, + (1 - 6) Xj G T. 

Definition 2.3. An affine combination of Xi,Xj G TZ are a set of points of the 
the form QiXi + OjXj, where 9{ + 0j = 1. 

These definitions and notations and a few others, are adopted from [5]. It is 
assumed that the vertices of the polygon V are arranged in order of traversal, starting 
from one of the vertices. The traversal order can be in any one direction. Another 
assumption is that the edges are traversed only once. This is useful in avoiding 
multiple loops that may occur in cases of intersecting polygons. 

If S lies out of the bounding box of the polygon, it is considered outside V and 
no further processing is done. Lastly, if the sample point is one of the vertices of the 
polygon, then it is considered to be in the polygon. This final point is assumed as the 
proposed solution would reach the same conclusion at the expense of computational 
time. In the theoretical proof, it will be shown that the assumption for implementation 
issue is correct. 

As the algorithm is explained the concepts of epigraph and hypographs will also 
be used for providing an analytically complete elucidation of the generated solution. 
The definition of these are as follows: 

Definition 2.4. The epigraph of a function (chain) f c : 7Z n — » TZ is a set of 
points that lie on or above the graph under consideration, such that epi(f c ) = {(x,t) : 

xen n ,te n, f c (x) <t} is a subset ofn n+1 . 

Definition 2.5. The hypograph of a function (chain) f c : TZ n — ^ TZ is a set 
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of points that lie on or below the graph under consideration, such that hypo{f c ) = 
{(x,t) : x £ K n ,t G UJ c {x) >t} is a subset ofK n+1 . 

Finally, to decide if the point lies inside or is inaccessible with respect to a polygon 
under consideration, the definition of nearest chains would be needed. This definition 
is as follows: 

Definition 2.6. Chains Ci andCj are nearest valid chains if: 

• the epi{fd) C epi(fc u ) ^ u G !>•••> 1 chains below S such that (xo;2/o) € 

• the hypo(fc j ) C hypo(fc v ) Vw € q chains above S such that (xo,ya) € 



3. Inaccessibility-Inside Theorem. Given the new solution, it becomes im- 
perative to prove the correctness of the solution. This follows due to the fact that 
definitions like the cross over and the winding number rule exist that state the meaning 
of inside from different perspectives, thus giving contradictory results. New definitions 
of inside and inaccessibility of a point S related to polygon V are proposed and a 
relation between inaccessibility and inside is proved. This proof shows that consistent 
results can be obtained if the meaning of the inaccessibility and inside of a polygon 
related to a point is are framed correctly in an abstract sense. 

It must be noted that the points that lie on vertices and edges are special cases 
and the definitions of inside and inaccessibility get slightly modified. But this does 
not mean that the meaning of inaccessibility and inside get twisted or modified from 
an abstract sense. Two cases are presented, one that is a point lying on a vertex and 
the other for the general case where it lies either on the edge or anywhere else. 

3.1. Point on Vertex of Polygon. The definitions of inaccessibility and inside 
are proposed for the case of a point lying on a vertex. The essence of abstract meaning 
of the same gets carried over to points not on vertex also but the definitions are slgihtly 
modified. 

Definition 3.1. The inaccessibility Inacc-p(S) of a point S related to a polygon 
V , is the number of valid chains that need to be broken. 



Definition 3.2. The status of a point S related to a polygon V , that is Inside-p{S), 
is the existance of a chain C such that S G epi(fc) or S £ hypo(fc). 



Based on these two definitions, two throrems need to be proved regarding the 
relationship of inaccessibility of a point as well as the status of a point whether it is 
inside with respect to the polygon. 

Theorem 3.3. A point S related to polygon V is inside as well as inaccessible 
when: Inside-p(S) € {1} iff Inacc-p{S) £ {A/"} 

Proof, (a) If Inacc v {S) € {TV} then Inside r (S) € {1} 

Given that Inacc-p(S) — A/", there exists M valid chains that need to be bro- 
ken according to definition |3.1| It is known that a chain is valid when either its 
epigraph or hypograph contains S. This existance of M valid chains imply that 



hypo(fcv)- 




M, J\f valid chains to be broken 
0, otherwise 
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S G {epi(fc k ),hypo(fc k )} Vfc G {1, ...,7V}. But this is the definition of stauts of S 
related to V, i.e. Inside-p(S) = 1 or Inside-p(S) G {1}. 

(b) If Inside v {S) G {1} then Inacc v (S) G {TV} 

Given Inside-p(S) = 1 implies that S G {epiifc): hypo(fc)} for a chain C in 
/. Thus chain C is a valid chain, as it contains the point S. In order for S to be 
inaccessible, there must exist atleast 1 vaild chain in V that needs to be broken. Since 
C is one such chain and the only chain that contains 5, the inaccessibility order of S 
related to V in Inacc-p{S) = 1 or Inacc-p(S) G {1}. 

If S is a vertex such that it is an intersection point of two or more lines of a 
polygon, then all chains that have their epigraph or hypograph contain S, are valid. 
Since it requires Af (if Af is the number of valid chains) chains to be broken. □ Cases 
need to be shown pictorially to get a feel of what the theorem is suggesting about. 
Figure |3.1a| shows three different polygons with S as the point under consideration. 



The polygon in figure 3.1a (A) has four chains that contain S namely (a) ST\J (b) 



UVS (c) SWX and (d) XYS, which are valid. Thus by theorem [373| I nside v (S) = 1 
and Inacc-p(S) = 4. Thus S lies inside the polygon. Similarly, for figure 3.1a (B) there 
is one chain 5TU5 which is valid as it contains the point S. Thus Inside-p{S) — 1 and 
Inacc-p{S) = 4. For the case of figure 3.1a| (C) there exists two chains that contain S 



i.e. (a) STUS and which are valid. So Inside-p(S) = 1 and Inacc-p{S) = 2. 

Note that since this holds true always when S lies on the vertex of a polygon, it 
is obvious and correct to assume that the point is in the polygon by first checking if 
iS is any one of the vertices in the polygon. This helps to avoid the implementation 
hurdle of checking the theorem. But again it is stressed that first the point needs to 
be checked against vertices of the polygon, in order to know if they belong to V . 

Theorem 3.4. A point S related to polygon V is not inside as well as not 
inaccessible when: Inside-p(S) G {0} iff Inacc-p(S) G {0} 

Proof, (a) If Inacc v (S) G {0} then Inside v (S) G {0} 

Given that Inacc-p(S) = 0, there exists no valid chains that need to be broken 
according to definition [3T] This means that S ^ {epi(/c fe ), hypo(fc k )} VA: valid chains 
in V. Since no chain exists whoes epigraph or hypograph contains 5, the status of S 
related tp V is Inside-p(S) = or Inside-p(S) G {0}. 

(b) If Inside v (S) G {0} then Inacc v (S) G {0} 

Given Inside-p{S) = implies that S {epi(fc k )ihypo(fc k )} Vfc chains in V . 
This means no valid chains exist in V that need to be broken. Thus the inaccessibility 
of S related to V is zero, i.e. Inacc-p(S) G {0}, which is the desired result. □ Cases for 
theorem |3.4| are simple and depicted in figure |3.1b| Figure |3.1b| shows two different 
polygons with S as the point under consideration. The polygon in figure 3.1b| (A) has 



four chains that do not contain S namely (a) RTU (b) UVR (c) RWX and (d) XYR, 
which are invalid. Thus by theorem |3.4| Inside-p(S) — and Inacc-p(S) = 0. Thus 



S lies outside the polygon. Similarly, for figure 3.1b (B) there exists two chains that 
do not contain S i.e. (a) RTUR and RVWR, which are invalid. So Inside-p(S) = 
and Inacc-p(S) = 0. 

3.2. Point not on Vertex of Polygon. Now for the general case of point in 
polygon, the definition of inaccessibility and inside evolve slightly while preserving the 
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(a) 5 as a vertex point (b) 5 not on the vertex 

Fig. 3.1: Polygons with locations of the point S. 



abstract essence of the idea. Again the theorems will be proved which find a relation 
between when a point is inaccessibile as well as inside the polygon. 

Definition 3.5. The inaccessibility Inacc-p{S) of a point S related to a polygon 
V , is the number of valid chains that need to be broken and/or ignored. 



Inacc-p (S ) 



1, a pair of chains need to be broken 

Af, Af 7^ pairs of chains to be ignored 

1 + Af , a pair to be broken and 
Af pairs to be ignored 



Definition 3.6. The status of a points related to a polygonV , that is Inside-p{S) , 
is the existance of a pair of chains Ci andCj such thatS £ epi(fd) andS £ hypo{fc j )- 

!1 , pairs of chains Ci and Cj , 
s.t. S € epi(fci) and S £ hypo(fc j ) 
0, otherwise 

Again the relation between inaccessibility and inside of a polygon is proved via two 
theorems. The theorems are as follows: 

Theorem 3.7. A point S related to a polygon V is inside as well as inaccessible 
when: Inside-p(S) £ {1} iff Inacc-p (S) € {1, 1 + J\f} 

Proof, (a) If Inacc v (S) £ {1, 1 +Af} then Inside v {S) £ {!}. 

Given Inacc-p (S) £ {1, 1 + Af} implies that there exist a pair of valid chain in 
V that need to be broken and/or M pairs of valid chains that need to be ignored. 
A valid chain by definition is one whoes epigraph or hypograph contains S. Taking 
the general case of 1 + M (if M = 0, 1 + M collapses to 1), there are 2 x (1 + Af) 
valid chains such that half lie above/on S and the rest half lie below/on S. If a 
vertical line passing through x = x Q is drawn such that it cuts the valid chains and S, 
then the chains can be sorted according to the value of intersection points in x = xq. 
Let C\ C2x(i+A0-i' ^2x(i+a0 be the sorted order of chains from bottom to top. 
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Taking consecutive pairs of these valid chains i.e. (C\, C-i), (C3, C4), (Cj, C;+i), 
(C 2x (i+a/')-1i C 2x (i+a/'))i it is easy to know whether S is an affine combination of 
(x ,y^) and (x ,ycf +1 ), V e {1, 3, 5, 2 x (1 +7V) - 1}. Here ygf and y^ +i are the 
intersection points on the chains k and k + 1 due to the line 2; = xo- 

Since it is known that atleast one pair of chains need to be broken, a pair of 
points (x , Ucf) and (x , Vcf +1 ) exists for which S (x , yo) = {xo, OVc? + (1 - %c"+i) 
for < 6 < 1. This implies that Cj is the nearest chain below/on S and Cj +1 is the 
nearest chain above/on 5, otherwise 5 won't be an afiine combination of (xoij/c ) 
and (^0)2/c"+i)- ^ or * ne res ^ °^ ^ ne ^ Pahs, since S is not an affine combination of 
(xo^ycf ) and i x o,Vc? +1 ) Vfc e {1,3, ...,2x (1 + AT) - 1} - {«}, these TV pairs of chains 
can be ignored from further processing or consideration. Since these nearest chains Cj 
and Ci + i are also valid, their epigraph and hypograph contain S, respectively. This 
existence of a pair of a valid chains which has to be broken such that S € epi(fc t ) 
and S £ hypo{fc i+1 ) implies Inside-p(S) = 1, the status of S related to V . 



(b) If Inside v {S) £ {1} then Inacc v (S) £ {1, 1 + Af} 

Let P be a polygon such that Inside-p(S) — 1 implies there exists a pair of chains 
Ci and Cj such that S £ epi(fd) and S £ hypo(fc j )- Given only these two chains, it is 
evident that both of them are nearest chins to S. Let the starting and ending points 
of d and Cj be {(x^*,yo), (x%* ,y )} and {(2^ , y ), (a£™< ,y )}, respectively. If a 
vertical line x = x$ is drawn through (xq, yo) it would interest the chains Ci and Cj at 
an d (^0,2/c™*)' respectively Since (a:o,2/c™*) ii es below (x ,yo) and (xo,yc™*) 
lies above (#0,2/0)1 <5 is an affine combination of (^o,?/™*) and (xo,?/™*)- Thus 5 lies 
between and Cj. Now, if an end point of is joined with an end point of Cj and 
another end point of the former joined to the remaining end point of the latter, then 
a closed loop is formed such that traversing once from any one point, lead to the same 
point in the end. Let this loop be V . 

As long as the winding number of V is around S is same as that of V around S, 
V can be deformed into V, by Hopf's degree theorem [5], Since V is formed from the 
two necessary chains Ci and Cj which are valid, a pair exists in polygon V that needs 
to be broken. Thus the minimum inaccessibility of S related to V is Inacc-p(S) = 1. If 
there exists extra pairs of valid chains, then they would be ignored from consideration, 
while checking for the affine combination criteria of S with respect to the sorted pairs 
of chains on x — xq. If Af is the minimum number of pairs of valid chains that are 
ignored, then the inaccessibility of S related to V is Inaccp-p(S) = 1 +Af. Thus 



Inacc-p(S) — {1, 1 + Af}. □ Now, the cases of theorem 3.7 are presented with visual 



representations in figures 3.2a and 3.2b In figure 3.2a (A), 3 chains exist of which two 



are valid. The valid chains are (a) UV and (b) WU. The chain VW is invalid. Thus 



Inside-p(S) — 1 and Inacc-p(S) = 1. In figure 3.2a (B) the point lies on the edge and 
the polygon can be divided into 3 chains of which only two contain S. These valid 
chains are (a) SU and (b) VS. Thus Inside-p(S) = 1 and Inacc-p(S) = 1. In part (C) 
of figure |3.2a[ the horizontal line cuts through two edges and touches two vertices. 
Thus there exists 4 chains of which two are valid, namely (a) UV and (b) XU which 
contain S. Thus Inside-p{S) = 1 and Inacc-p(S) = 1. Lastly, in part (D) of the same 
figure, S lies on an edge and the horizontal line passes through a vertex. In this case, 
there exist 5 chains of which two contain the point of test and are thus valid. They 
are (a) VW and (b) YU. Thus Inside-p(S) = 1 and Inacc-p(S) = 1. 



Next, in figure |3. 2b (A) 8 chains exist namely, (a) YZ (b) ZW (c) WB (d) BU (e) 
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(a) S not on the vertex (b) S not on the vertex 

Fig. 3.2: Polygons with locations of the point S. 



UV (f ) VA (g) AX and (h) XY, of which 6 are valid except UV and XY. These two do 
not contain S. Now, the pair that needs to be broken is YZ and ZW, while the pairs 
(WB, AX) and (BU, VA) are to be ignored from consideration. Thus, Inside-p(S) = 1 
and Inacc-p (S) = 1 + 2 (i.e one pair requires to be broken and two need to be ignored 
from consideration). Finally, in figure 3.2b (B) two chains exist of which both are 
valid, i.e. (a) UV and (b) VWWU. Thus Insider (S) = 1 and Inacc v (S) = 1. 

Theorem 3.8. A point S related to a polygon V is not inside as well as inacces- 
sible when: Inside-p(S) G {0} iff Inacc-p {S) G {A/ - } 

Proof, (a) If Inacc v {S) G {Af} then Inside v (S) G {0}. 

Given Inacc-p (S) = Af implies pairs of valid chains need to be ignored. Again, a 
valid chain by definition is one whoes epigraph and hypograph contain S. If a vertical 
line passing through x — x is drawn such that it cuts the valid chains and S, then 
the chains can be sorted according to the value of intersection points in x = xq. Let 
Ci,...,C2xJV be the sorted order of chains from bottom to top. Taking consecutive 
pairs of these valid chains, i.e. (Ci, C 2 ), (C 3 , C 4 ),..., (Cj, Cj), (C 2x jV-i, C 2x aa), 
it is easy to know whether S is an affine combination of (xq, Vcf) an d (%q, y l cl ) 
Vfc € {1, 3, 2 x JV — 1}. Since Af pairs need to be ignored, it is evident that S is 
not an affine combination of any of the above pairs. This suggests that there does not 
exist a pair such that S € epi(fc k ) and S G hypo(fc k+1 ), that can be broken. Since no 
such pair exists, the status of S related to V is Inside-p(S) = 0, which is the desired 
result. 



(b) If Inside-p (S) G {0} then Inacc v (S) G {Af}. 



Given Inside-p(S) — implies there does not exist a pair of chains Ci and Cj 
such that S G epi(fc i ) and S G hypo(fc j ). Thus it is difficult to proceed with the 
proof. Instead, by proving its contrapositive the above statement will hold. Thus 
if Inacc-p (S) ^ {Af} then Inside-p(S) ^ {0}. Since Inacc-p (S) £ {Af}, it im- 
plies that Inacc-p{S) G {1,1 + Af}. It has been proved in part (a) above that if 
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Inacc-p(S) £ {1,1 + A/"} then Inside-p(S) £ {1}. But Inside-p(S) £ {1} also means 
that Inside-p(S) ^ {0}. Thus Inacc-p(S) £ {Af} implies that Inside-p(S) ^ {0}. 
Since the contrapositive holds, so does the original statement. □ Finally, two cases for 



theorem 3.8 are depicted in figure 3.3 In figure |3.3| (A) 8 chains exist namely, (a) UV 
(b) VA (c) AX (d) XY (e) YZ (f) ZW (g) WB and (h) BU. Of these, 3 pairs exist, 
all of which are valid chains but need to be ignored as non of them enclose S as an 
affine combination of two intersection points on x = xq. These pairs are (WB, AX), 
(XY, ZW) and (VA, BU). Thus Inside v (S) = and Inacc v (S) = 3. For the case of 



intersecting polygon in figure 3.3 6 chains exist namely, (a) ZU (b) UY (c) YW (d) 
WX (e) VX and (f) XZ, of which WY and XZ are not valid. Remaining pairs of valid 
chains need to be ignored and thus Inside-p{S) = and Inacc-p(S) = 2. 

For the next few sections, let EH (epi/hypo-graph method) denote the proposed 
method. 

4. Crossover vs EH. Crossover (Cr) states that a line drawn from a point S 
in a direction, if it cuts the polygon V odd number of times, implies that S is inside 
V, i.e. 



Inside% R (S) = 



1, odd intersections 
0, even intersections 



For the case of vertices, the problem is solved by shifting the line infinitesimally. Two 
issues arise when a line is shot from S and it intersects a vertex. (1) There can be 
two solutions, if the line is not shifted slightly. (2) If the crossover has to be repeated 
several times until it finds an odd number of intersections, then it is a nondeterministic 
problem, in case the line is shot randomly. 

By (1) ambiguity arises on the way a ray or line is shot from S and by (2) 
nondctcrminism arises due to repeatedness because of the line being shot randomly. 
The following figures will illustrate these issues in detail. In contrast to the Cr, by 
checking through theorems |3.3| and |3.7| the EH method can easily determine if S lies 
in V or not, deterministically. This is because whichever way a line is drawn through 
iS, if it cuts the polygon, then it will dismember V into a finite number of countable 
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chains. If it doesn't cut the polygon and 5 is a vertex, then also there exist atleast 
one chain that contains 5. Searching for these valid chains and then locating which 
of those need to be broken is deterministic. 



The figure 5.1a shows the different cases under consideration for the comparison 
of CR and EH method for the same point of investigation. In figure 5.1a (A), if the 



a horizontal line is drawn to the left of the 5, then it intersects at two points U and 
V and if it drawn to the right, it intersects at the point W. According to CR, when 
the line is drawn to the right of the 5, then 5 is inside the polygon. If the line is 
drawn to the left of 5, then the point is outside the polygon. This is definitely a case 
of ambiguity. Also, the outcome of the CR depends on the direction of the ray that is 
shot from 5. This makes the outcome of the test nondeterministic in the sense that 
it is not known which ray would give the correct result, if the rays are shot randomly. 

The EH method overcomes this problem by segmenting the polygon into finitely 
countable chains. The searching for an affine combination of valid chains that may 
contain 5 is deterministic as there are only limited number of chains available for 
checking. Thus the outcome of the EH method is final and deterministic. If two 
perpendicular rays with its intersection point as 5 are drawn at a different orientation, 
thus intersecting the polygon at different places, even then by rotating the oriented 
axis and the polygon to horizontal vertical frame, the solution remains the same. Thus 
randomness of the ray s do n ot affect the outcome of the point inclusion test for 5. For 
part (A) in the figure 5.1a by CR Inside% R (S) — (0, 1) depending on the number of 
intersections that is (2,1). By EH method, Inside® 11 (5) = 1 and Inacc® 11 (S) = 3 
by theorem |3.3| It must be noted that the inaccessibility of the point related to the 



polygon may change but the status of 5 related to V captured by the definition of 
Inside will not change if the point is inside th e poly gon. 

by CR the Inside% R (S) = (1, 0) 



Similarly, for the part (B) and (C) in figure 



5.1a 



depending on the number of intersections based on the direction of the ray which is 



(1, 2). Finally, in figure 5.1a (D), for point Si four valid chains exist namely, (a) VW 
(b) XU (c) U i>2 and (d) SJV, none of which need to be broken or ignored. Thus by 
theorem 



3.7 



Inside® 11 (Si) = and Inacc® H (Si) = 2. By CR the outcome of the 
inclusion test changes, that is Inside^ R (Si) = (0, 1) depending on the intersections 
obtained by the direction of the ray that is (2, 3). For the p oint 6>2, two valid chains 
exist namely (a) 5 2 V and (b) VWXU5 2 . Thus by theorem |3~3} Inside% H (S 2 ) = 1 
and Inacc® H (S 2 ) = 2. By CR Inside®^ 1 (S 2 ) — as the number of intersections is 4. 

5. Winding Number Rule vs EH. The winding number rule (WNR) states 
that the number of times one loops round the point S before reaching the starting 
point on polygon V, decides the number of times whether S lies inside V or not. Thus 



Inside™ NR 



(S) 



0, 



Af loops around S 
zero loops around S 



An analogy of a prison wall shown in figure |5.1b| is taken into account to the explain 
the differences. Figure 5.1b (A) is the intial structure of the prison and then the 
final structure is shown in the part (B) of the same figure. Initially, via the WNR, 
iSi was lying outside and 52 inside the prison wall. Same is the verdict by the new 
method. Next a portion of the prison wall is extended and the final structure looks 
like that in figure 5.1b| (B) . Note that Si and 52 are still outside the new prison 
via the new definition, as the areas in which Si and S 2 lie, are not reachable from 
prison's perspective. This is because two pairs of walls have to be ignored and not 
broken in each case. From this point of view both Si and 52 are outside V, in figure 
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Fig. 5.1: Cases to compare Crossover (CR), Winding Number Rule (WNR) and the 
proposed method (EH). 



5.1b (B). Also, even though WNR = 2 for 52 in new prison in part (B) of the same 
figure, implies the point lies twice inside, it does not make sense. It can be stated 
that if a point lies inside once, then it lies forever. There does not arise the idea of 
point lying inside N times. Thus a point lying inside Af times, is the same as point 
lying inside once. If it does not lie inside, then it won't lie forever. In this way the 
new definitions and the accompanying theorem are definitive in producing a concrete 
answer via means of epigraph-hypograph method. 

6. Conclusion. A theoretically reliable and an analytically correct solution to 
the point in polygon problem is proposed. The proof for the same is given by building 
a relationship between two new concepts of inaccessibility as well as inside. It is 
proved that the solution is good interpretation of inside for both the simple and self 
intersecting polygons, which forms the crux of the manuscript. 
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Fig. 7.1: Query point location in (A) closed and (B) intersecting polygon. 



Appendix. 

7. Algorithm Implementation. The implemented algorithm will be illustrated 
with each step explained with a pictorial representation of result of the execution of 
that particular step. Examples include closed, intersecting and non intersecting poly- 
gons. Figures |7.1| and |7.2| show the polygons with the sample point being tested at 
different locations. 

7.1. Intersecting the V . V is an ordered series of vertices (xi,yi), (x%, i/2),---, 
(x„,y„), starting from (x±,yi) that defines the polygon. Given V, the first step is 
to draw a straight line through the sample point S (xo,yo), such that it intersects 
the polygon at certain points. For simplicity, the horizontal line y — y a is considered 
without loss of generality. 

The intersection points are obtained via computing the values of the Xi n t coordi- 
nates between the y = yo and the straight line, extending from (xi, yi) to (xi+i, yi+i)- 
The slope and the constant of the former is and yo, and that of latter is rrii — 
and Ci = yi^niiXi. Here i and i + 1 are consecutive points on V . Solving the algebraic 
equation between two straight line gives: 



(7.1) 



Ant Vo 



Once an intersecting point with coordinate (x mt , yo) is obtained, the algorithm checks 



rrii 

int 
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Fig. 7.2: Query point location in (A) closed and (B) intersecting polygon. 



if this point lies in between (x,-,yj) and (xj+i, y;+i), on the line. This is achieved via 
affinc combination property in definition |2.3| Three different cases arise depending 
on the slope of the line: 

1. rrii — ±oo : If the edge is a vertical line, the point (x mt ,yo) lies on the line. 
This is because x mt = Xi = 

2. rrii = : If edge is a horizontal line, the intersection point (x mt ,yo) is con- 
sidered to lie outside the range of (xj, yo) and (xi+i, yo)- This is because, if it 
lies within (xj,yo) an d (xi+i,yo); then there are infinitely many points that 
could be considered. To save from randomly selecting any point within the 
given range, it is thus considered that the point (x mt ,yo) outside the range. 
Also, if (xo, yo) lie on a horizontal edge, it is still considered outside the range 
for further processing. 

3. rrii € TZ— {0, ±00} : Finally, this being the simplest case, it is easy to com pute 
whether (x , yo) lie on the line between the given points using definition 



2.3 



Figures [7 . 3| and |7 . 4| show the horizontal line y — yo passing through S and intersecting 
the polygon at different edges. The sample point is indicated via the red arrow while 
the vertices at the intersection of y — yo with edges of the polygon are pointed to 
by the blue arrows. Once the vertices of the intersecting points are computed, they 
are added to the list of pre existing vertices of the polygon under consideration. The 
newly created intersecting vertices are appended in such a manner that the traversal 
order is not affected. 
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Fig. 7.3: Finding interesecting points on the polygon. Sample point indicated via red 
arrow lies in (A) closed and (B) intersecting polygon. Blue arrows point to newly 
found intersection points. The green line depicts y = yo. 



It may happen that some of the newly added vertices with their y coordinate as 
yo are very close to other pre existing vertices in the tolerance range of ±10~ 5 or 
smaller. The algorithm removes the vertices from the list that lie in such close range, 
but keeps the newly added vertices with coordinates (x mt ,yo). Two reasons arise for 
executing this step: 

1. To avoid further computations that may involve floating point precision of 
the order smaller than or equal to ±10~ 5 . 

2. To retain newly appended vertices with coordinates (x mt ,yo) that will be 
later used for searching chains whose epigraph or hypograph may contain S. 

It must be noted that this insertion of vertices does not affect the solution to the PiP 
on paper, but may affect solution practically on a computer due to the floating point 
representation and operations on it. Also, note that the tolerance range is not the 
issue investigated in this work. 

7.2. Decomposition of Polygon into Valid and Invalid Chains. The new 

vertices with coordinates (a;™ 4 , yo) were (j <E {1, m}) and the sample point (xq, yo) 
become form the basis for the next steps. From definition [2T] it is known that a point 
belongs to the epigraph (hypograph) if it lies on or above (below) the graph under 
consideration. 

To use the mentioned properties, the polygon V needs to be decomposed into the 
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Fig. 7.4: Finding intersecting points on the polygon. Sample point indicated via red 
arrow lies out of (A) closed and (B) intersecting polygon. Blue arrows point to the 
newly found intersection points. The green line depicts y — yo- 



chains. These chains would then be tested for convexity or concavity. The algorithm 
achieves this in the following way. A vertex on V, especially with the y coordinate 
being yo is picked up as the starting vertex. A traversal order is chosen randomly and 
is followed until the starting point is reached again. 

Vertices lying on a path between any two consecutive (xl nt ,yo) and (xj nt ,yo) 
points on V, including the intersecting points themselves, form a chain. Thus as 
the traversal is done from one intersecting vertex to the other, the polygon gets 
decomposed into chains, that lie either above or below the horizontal line y = yo- 

In 



Next the chains are tested for validity using the definitions in 2.4 and 2.5 



non mathematical terms, if the starting and ending vertices of a chain are on either 
side of the S on the line y = yo and not on the same side with respect to the sample 
point, then the chain is a valid one. The valid chains are stored with their starting 
and ending vertices along with coordinates of points on the chain. 

Figures |7.5| and |7.6| show the valid and invalid chains pointed by the green and 
blue arrows respectively. The solid and dotted lines also demarcate the chains. The 
solid lines represent the valid chains and the dotted lines represent the invalid chains. 
The red arrow indicates the sample point's location in each of the figures. 

7.3. Chain Intersection. Hitherto, it is known that the x coordinate of S lies 
in the epigraph or the hypograph of the valid chains. To know that the sample point 
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Fig. 7.5: Polygon decomposition into chains. (A) closed and (B) intersecting polygon 
decomposed into chains. Green and blue arrows indicate the valid and invalid chains, 
respectively. Location of query point w.r.t the polygon is shown via the red arrow. 



lies within the polygon, finally it needs to be tested whether the y coordinate of iS 
lies within any two nearest valid chains. The rationale behind doing these steps will 
be elucidated a little later, but before that it is important to define what the nearest 
valid chains mean: 

After the valid chains have been obtained, a similar procedure of intersecting the 
chains is done using the vertical line x — xq. For each chain, defined by a set of 
vertices (xi,yt), points of intersection are computed between the edges of the chain 
and the vertical line. The slope of the straight line joining two vertices on the chain 
is mi = ^***Z^'. and the constant is c, = yi — rniXi. Now, i and i + 1 are consecutive 
points on the valid chain. The value of y coordinate of the point of intersection are 
computed as follows: 



(7.2) y mt = mi x + c t 

Once the coordinate of the intersection point (xo, y lnt ) is obtained, a test is conducted 
to find whether the intersection point lies in between {Xi,yj) and (xj+i, 2/i+i)- This 
is achieved using the affine combination property in definition |2.3| Three cases may 
arise, that need to be considered: 

1. rrii = ±oo : If edge is a vertical line, the intersecting point (xo,y mt ) is 
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Fig. 7.6: Polygon decomposition into chains. (A) closed and (B) intersecting polygon 
decomposed into chains. Green and blue arrows indicate the valid and invalid chains, 
respectively. Location of query point w.r.t the polygon and is shown via the red arrow. 



considered to lie inside the range of (xo,yi) and (xq, Di+i)- This is because, 
if the chain crosses x = Xq many times before going from left of S to the 
right of S or vice versa, then there can be infinitely many points on one chain 
that may be considered as intersection points. That is, it may require the 
algorithm to store many intersection points for just one chain. To avoid the 
presence of many intersection points on a single chain, the algorithm stores 
the vertex of one of the vertical edges in a chain. 

n%i = : If the edge is a horizontal line, the intersecting point (xo,y mt ) lies 
on the line. This is because x mt — xq and y mt = J/j = j/j+i- 
nrii € TZ— {0, ±00} : Finally, this being the simplest case, it is easy to com pute 
whether (xq, y mt ) lie on the line between the given points using definition 2.3 



The pictorial representation of the line x = Xq intersecting the valid chains are shown 
in figures [7~7| and [778] The green arrows indicate the points of intersection on the valid 
chains that are now pointed to by the blue arrows. The sample point is indicated via 
the red arrow. As mentioned earlier, the invalid dotted chains have been removed by 
the algorithm, in its final stage of processing. 

7.4. Point Inclusion Test. Each of the valid chains have a point of intersection 
with x — xq. Next, the algorithm sorts the chains according to the y mt . This is done 
so as to arrange all the intersecting points (xo,y mt ) in the different chains in an 
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Fig. 7.7: Point inclusion test. Valid chains of (A) closed and (B) intersecting polygon 
indicated by blue arrows. Green arrows depict the newly found intersection points 
with the vertical line x = xq. The red arrow shows the point lies inside the polygon. 



ascending order, with the sample point (xq, yo) somewhere in between (if the case is). 

Finally, considering a pair of y mt, s at a time, i.e a pair of chains at a time, it is 
tested whether (xoyo) is an afhne combination of (xq, yl nt ) and (xo, J/} nt ) (for i < j). If 
such a pair is found, then the point lies inside the polygon, else it is outside. The only 
care that the algorithm takes while executing this step is that, the pair of chains or 
the pair of intersecting y coordinates are mutually exclusive. This means that if there 
are n/2 pairs, with n being an even number and y\ , y™ , y™ are the intersecting 
y values in order, then (yi ra *,y| nt ), (y™*, yl™'), (y^i,y„ n *) are mutually exclusive 
in the sense that element of one pair cannot be included in any other pair. 

The fundamental idea behind such a rule is that, in a pair, if the path of traversal 
in a chain is moving from left side of S to the right side (say), then the traversal in 
the other chain must move from right side S to the left. Thus, any pair shall not 
contain chains from any other pair. 

These ideas can be seen in figures |7.7| and |7.8| The green arrows mark the 
intersecting points on the valid chains. The test of affine combination with respect to 
the sample point S, indicates whether the point lies within the polygon or without. 



8. Time Complexity. The computational complexity of the algorithm in terms 
of time needs to be addressed. With n edges, the algorithm takes a constant time of 
G\ to process the first step mentioned in section 7.1 G\ is the constant number of 
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Fig. 7.8: Point inclusion test. Valid chains of (A) closed and (B) intersecting polygon 
indicated by blue and green arrows, respectively. Green arrows also depict the newly 
found intersection points with the vertical line x = xq. The red arrow shows the point 
lies outside the polygon. 



steps used in computing the intersection of the horizontal line with each of the edges 
and deciding whether the intersection point lies on the edge, between the vertices 
of the edge. This process thus takes up C\n steps. The newly found intersection 
points are added to the vertex list of the polygon while some vertices close to the 
intersecting points in the range of ±10~ 5 or less are removed to address the floating 
point precision. This execution step on the whole increases the number of vertices by 
a fraction, say fx were < fx < 1. 

Thus the total number of vertices on the polygon now amount to (1 + /i)n. 
The decomposition of the polygon into different chains requires the processing of all 
(1 + fx)n edges, as has been described in section 7.2 This processing is of the order 
of (1 + fx)n with some constant Ci required for checking if the chains are valid or 
not. 



In section 7.3 as in section 7.1 the points of intersection are computed for the 



vertical line and one of the edges in each of the chains. Since the number of edges 
belonging to the chains is a fraction f% (0 < fi < 1) of the total number of edges 
of the polygon, the execution of this step requires Cxf^i^ + fi) n units of cpu time. 
Finally, after sorting of the chains which requires mlog(m) where m = /2(1 + fx)n 
and checking whether the sample point is an affine combination, requires a constant 
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Fig. 8.1: Check of statistical significance using McNemar's Test for polygons with 
increasing vertices. Red, blue and green lines indicate the significance of P vs Cr, P 
vs Wi and Wi vs Cr, respectively. The black line represents the standard x 2 value of 
3.85 with a p value of 0.05. The vertical axis is the logiQ representation of the test 
values. 



time of C3. This procedure is mentioned in the previous section. 

Summing up the total time of execution, the algorithm works in (C±n + (l + fi)n+ 
C 2 +C 1 f 2 (l+f 1 )n)+C 3 +mlog(m) = (C 1 +l+/ 1 +C 1 / 2 +C 1 / 2 / 1 )n+C 2 +C 3 +mZo 5 (m). 
Let A = (Ci + 1 + ft + C x h + C1/2/1) and B = C 2 + C 3 , then the computational 
time complexity of the algorithm is An + B + mlog(m) — > 0(nlogn). This time in 
computation is the worst case scenario, where all the edges have to be processed in 
for all the four main steps described above. 

9. Results. With the proof that the proposed algorithm gives theoretically re- 
liable results, it would of interest to know how algorithms based on the two widely 
used crossing over and winding number rule concepts, fair. One of the measures of 
fairness is the test of significance of results obtained from the algorithms. 

Artificial polygons with number of vertices in the set 2 l where i € {2,. ..,11} 
were generated. For a particular vertex number, 10000 polygons were generated and 
stored. Each polygon was generated by randomly generating the coordinates and 
joining the vertices in order. The last vertex was finally joined with the first vertex. 
This created a series of intersecting as well as non intersecting polygons. All polygons 
were generated in the bounding box of [0, 1]. 

For each of the 10000 polygons having a particular number of vertices, a random 
test point was generated in the bounding box. McNemar's Test pQ was employed 
to check the statistical significance of one algorithm against another. In a 2 x 2 
contingency table, the McNemar's test gives a statistic similar to the chi squared 
statistic which is formulated as: 



(9.1) 




(noi - n 1Q ) 2 
n i + n w 



(9.2) 




(Ki - nioj - l) 2 
riQi + n w 
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Fig. 9.1: Error rates of Cr and Wi with respect to P. 
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Fig. 9.2: False positive rates of Cr and Wi with respect to P. 



where, noi and nio are the false positives and the false negatives, respectively. Equa- 
tion 9.2 was used as it contains the correction for discontinuity. In this study a \ 2 
value of 3.84 and above was set as a standard to account for the significant difference 
of one algorithm against another. Thus with p < 0.05, it is highly unlikely, that the 
algorithm may be significant from the one that it is being compared with. 

In the present scenario, by the proof of the above theorems, it is known that 
the proposed algorithm give exact results. Thus it is expected that the results 
obtained from the implementation of the same theoretical idea shall ap- 
proach ground truth with a margin of error that is completely due to the 
precision format of the computer. Let (P) be the proposed algorithm, (Cr) the 
Crossing over algorithm and (Wi) the Winding number rule algorithm. A matlab 
version of the crossing over algorithm was adopted from [2J. Matlab's [3] inpolygon 
algorithm was taken as an implementation of the winding number rule algorithm. 

A general analysis is presented relating to the overall behaviour of the algorithms 
apropos to McNemar's Test, precision, recall, false positive rate (Fpr) and error rate, 
as the number of vertices increases. All artificially generated polygons have been 
stored and can be reused with a new set of random test points to generate similar 
results for checking. 
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Fig. 9.3: Precision of Cr and Wi with respect to P. 




Fig. 9.4: Recall of Cr and Wi wrt P. 



10. Discussion. In order to evaluate the degree of fairness of the crossing over 
and the winding number rule based algorithms w.r.t the proposed algorithm, Mc- 
Nemar's test was performed on polygons with increasing number of vertices with as 
sample size of 10000. Final results in figure [83] show the statistical significance in the 
performance of the algorithms against each other. Clearly, it can be seen that the 
results generated by Cr is significantly similar to those generated via P (red graph in 
the figure). While comparing the results of Wi with those of Cr as well as P, it can 
be seen that the results are significantly different as the number of vertices increases 
(blue and green graphs in the figure). 

This apparent statistical difference is due to the fact that if the winding number 
is I ^ 0, then the point lies i times inside the polygon. This is not true with the Cr 
which states that even cross overs imply the that points are outside the polygons. P, 
on the other hand sets a bound on the winding number rule, stating that the if t ^= 0, 
then the point lies inside the polygon if and only if the criterion of having a pair of 
chains (one above and other below the point) is true (via proved theorems above). 
Thus even if the winding number evaluates to say 2, in case of the star polygon for 
the point in figure |7.4| P will always give the result that the point lies outside the 
polygon and not twice inside the polygon. 



22 



INACCESSIBILITY-INSIDE THEOREM FOR POINT IN POLYGON 




Increasing # of vertices (sample size 10000) 

Fig. 10.1: Average time in seconds on a logio scale for polygons with increasing 
number of vertices. Sample size of 10000 for each number of vertices. 



Figure |9.1| shows the accuracy of the results obtained by the Cr vs P and Wi 
vs P. The behaviour of the error rate of Wi with respect to P and Cr, again can be 
attributed to the explanation in the foregoing paragraph. Even though the results of 
the Cr have been shown not to be statistically significant to that of P, with increasing 
number of vertices (on a sample size of 10000), the error rate creeps up. The accuracy 
by itself does not always suffice to give the measure of correctness and thus is aided 
via means of the false positive rate, precision and recall. The graphs for the same 



have been depicted in figures 9.2 9.3 and 9.4 



The false positive rate tells how much one algorithm generated falsely true results 
given that base algorithm had labeled the results as false. Thus comparing results of 
Cr against P and Wi against P, the false positive rate was generated over increasing 
number of vertices and calculated on a sample size of 10000 polygons. The graphs 
suggest that both the Cr and Wi start contributing to the false positives with respect 
to P as the number of vertices increases. Wi also gives greater false positives with 
respect to Cr. Given the results generated by the precision, recall and false positive 
error rates it can be stated that with polygons constituting large number of vertices, 
even Cr deviates from P. The Cr which works on the fundamental idea of drawing a 
semi infinite line from the sample point thus crossing the edges/vertices of polygon 
can give different results depending on where the line passes through. A different line 
drawn through the same sample point may lead to a contrasting result as has been 
discussed in the previous section [4] Another case that happens in Cr is that if the line 
passes through say one edge and a vertex (or two vertices) , then the results based on 
even odd rule, change. It has often been pointed out that such cases are very rare and 
can be dealt by infinitesimally shifting the sample point and the line slightly. First, 
the the idea of being a rare case does not imply that the problem is solved. Secondly, 
the solution of shifting the sample point does work, but again it is argued that would 
it give theoretically correct solution, if the sample point was on another vertex and a 
slight shift would put the point outside the polygon itself. 

With the proof of the proposed solution in the manuscript, it can be shown 
that even if the polygon is intersected by two perpendicular lines passing through the 
sample point at angles 8 and #±90, the lines after rotation to horizontal- vertical frame, 
will unarguably give correct results. Thus in rare cases also, the proposed solution 
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Increasing sample size (polygons with 2048 vertices) 



Fig. 10.2: Cumulative time in seconds on a logio scale for increasing sample size for 
polygons with number of vertices 2048. 



will work and give theoretical ground truths. Coming back to the false positive rates, 
with a large sample, increasing size and complexity of the polygons, the Cr and Wi 
are bound to give false positives with respect to P. The precision and the recall graphs 
in figures [9~3| and [9T4[ suggest the concentration and the retrieval of the results. 

to 



Lastly, the average time in seconds on a log scale has been plotted in figure 10.1 



depict the time consumption by the algorithms based on the concepts of the proposed 
solution, the crossing over and the winding number rule. The proposed solution 
apparently takes more time than the crossing over which in turn consumes more time 
than the winding number. Also, in figure |10.2[ for polygons with the number of the 
vertices 2048, and an increasing sample size, the cumulative time in seconds is plotted 
on the log scale. These graphs suggest that the reliability of the proposed solution 
comes at a computational cost. But apart from that, the correctness of the proposed 
solution is not affected, which forms the kernel of the manuscript. 
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